Referential integrity quality metrics
نویسندگان
چکیده
Referential integrity is an essential global constraint in a relational database, that maintains it in a complete and consistent state. In this work, we assume the database may violate referential integrity and relations may be denormalized. We propose a set of quality metrics, defined at four granularity levels: database, relation, attribute and value, that measure referential completeness and consistency. Quality metrics are efficiently computed with standard SQL queries, that incorporate two query optimizations: left outer joins on foreign keys and early foreign key grouping. Experiments evaluate our proposed metrics and SQL query optimizations on real and synthetic databases, showing they can help detecting and explaining referential errors.
منابع مشابه
A Referential Integrity Browser for Distributed Databases
We demonstrate a program that can inspect a distributed relational database on the Internet to discover and quantify referential integrity issues for integration purposes. The program computes data quality metrics for referential integrity at four granularity levels: database, table, column and value, going from a global to a detailed view, exhibiting specific evidence about referential errors....
متن کاملInvestigation of Application Specific Metrics to Data Quality Assessment
Databases have risen to be one of the most important corporate assets, but usually their data quality is poor or even not manageable at all. Several metrics of data quality have been designed and implemented to monitor a database of an information system. The primary goal of data quality metrics design was to provide the managers of information centres the tools for monitoring of their database...
متن کاملReferential Integrity Is Important For Databases
Referential integrity is a database constraint that ensures that references between data are indeed valid and intact. Referential integrity is a fundamental principle of database theory and arises from the notion that a database should not only store data, but should actively seek to ensure its quality. Here are some additional definitions that we found on the Web. • “Referential integrity in a...
متن کاملIndex Design for Enforcing Partial Referential Integrity Efficiently
Referential integrity is fundamental for data processing and data quality. The SQL standard proposes di↵erent semantics under which referential integrity can be enforced in practice. Under simple semantics, only total foreign key values must be matched by some referenced key values. Under partial semantics, total and partial foreign key values must be matched by some referenced key values. Supp...
متن کاملA Study of Quality Assessment Techniques For Fused Images
290 Abstract: Critical image processing tasks can be efficiently executed by fusion of images taken from range of distributed sensors. Advancements in digital image processing and communication technology with invent of new sensors experiencing the excessive need of effective image quality assessment of image fusion techniques. Various metrics have been discussed for quality measurement of fuse...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Decision Support Systems
دوره 44 شماره
صفحات -
تاریخ انتشار 2008